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DETAILED ACTION 
Response to Amendment 

1. In response to the Office Action mailed December 9, 2005, applicant submitted 
an amendment filed on March 9, 2006, in which the applicant traversed and requested 
reconsideration with respect to claims 1, 8 and 15-18. 

Response to Arguments 

2. Applicants argue Keiller does not disclose or suggest storage means for string a 
recording character string indicating a sentence to be recorded. Rather the description 
of Keiller refers to an operation of an apparatus after it has been trained, i.e. refers to 
operation of an apparatus as a speech recognition apparatus to perform speech 
recognition on unknown speech. In response to applicant's argument that the 
references fail to show certain features of applicant's invention, it is noted that the 
features upon which applicant relies (i.e., operation of an apparatus after it has been 
trained) are not recited in the rejected claim(s). Although the claims are interpreted in 
light of the specification, limitations from the specification are not read into the claims. 
See In re Van Geuns, 988 F.2d 1 1 81 , 26 USPQ2d 1 057 (Fed. Cir. 1 993). 

Applicants also argues that Kieller does not compare a pattern of recognized 
character string (corresponding to the stored recording character string) with a pattern 
of the recording character string stored on the storage means so as to obtain a 
matching rate therebetween. The recognition performed in Keiller is used in speech 
recognition, not in enrollment for the acoustic model, response to applicant's argument 
that the references fail to show certain features of applicant's invention, it is noted that 
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the features upon which applicant relies (i.e., enrollment for the acoustic model) are not 
recited in the rejected claim(s). Although the claims are interpreted in light of the 
specification, limitations from the specification are not read into the claims. See In re 
Van Geuns, 988 F.2d 1181, 26 USPQ2d 1057 (Fed. Cir. 1993). 

Applicants' arguments have been considered, but are not persuasive. 



Claim Rejections - 35 (JSC § 102 

3. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 



4. Claims 1-3, 8-10 and 15-18 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Keiller (USPN 6,560,575). 

Regarding claims 1, 8, and 15, Keiller discloses an apparatus, method and 
system for recording speech, to be used as learning data for recognizing input speech, 
comprising: 

storage means for storing a recording character string indicating a sentence to be 
recorded (column 16, lines 12-19); 



Application/Control Number: 09/976,098 Page 4 

Art Unit: 2626 

recognition means for recognizing input speech for use as the learning data so 
as to obtain a recognized character string (input is taken as two training examples: one 
a new example and one an already existing example; column 15, lines 25-35) 
corresponding to the stored recording character string pattern (column 16, lines 16-19); 

determination means for comparing a pattern of the recognized character string 
with a pattern if the recording character string stored in said storage means so as to 
obtain a matching rate therebetween, and determining whether said matching rate 
exceeds a predetermined level (system checks whether training examples are 
consistent (column 15, lines 28-30) by computing the consistency scores (column 15, 
lines 53-65) and comparing the result again against the threshold (95%, column 16, 
lines 6-8)); and 

recording means for recording the input speech as the learning data for 
recognizing speech when it is determined by said determination means that said 
matching rate exceeds a predetermined level (if the results are consistent, they are 
used to generate a model for word being trained (column 15, lines 27-30), so inherently, 
the generated model is stored (recorded) to some memory means (see also column 16, 
lines 12-15)). 

Regarding claims 2 and 9, Keiller discloses an apparatus and method further 
comprising re-input instruction means for issuing an instruction to input speech once 
again when it is determined by said determination means for issuing an instruction to 
input speech once again when it is determined by said determination means that said 
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matching rate does not exceed the predetermined level (if words do not match, the 
system required a new example; column 15, lines 32-35). 

Regarding claims 3 and 10, Keiller discloses an apparatus and method wherein 
said determination means determines said matching rate by performing DP matching 
between the recognized character string pattern and the recording character string 
pattern (dynamic programming is used to determine whether the inputted words are 
consistent with each other; column 14, lines 48-55 with column 15, lines 54-65). 

Regarding claim 16, Keiller discloses a speech recognition method comprising: 
a learning recognition step of recognizing input speech, to be used as learning 
data, so as to obtain a recognized character string (input is taken as two training 
examples: one a new example and one an already existing example; column 15, lines 
25-35); 

a determination step of comparing a pattern of the recognized character string 
with a pattern of a recording character string indicating a sentence to be recorded so as 
to obtain a matching rate therebetween, and of determining whether said matching rate 
exceeds a predetermined level (system checks whether training examples are 
consistent (column 15, lines 28-30) by computing consistency scored (column 15, lines 
53-65) and comparing the result against a threshold (95%, column 16, lines 6-8)); 

a recording step of recording the input speech as the learning data for 
recognizing speech when it is determined in said determination step that said matching 
rate exceeds a predetermined level (if results are consistent, they are used to generate 
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a model for word being trained (column 15, lines 27-30), so inherently, the generated 
model is stored (recorded) to a memory means (column 16, lines 12-19)); 

a learning step of performing learning on a speech model by using the input 
speech recorded in said recording step (the process described above provides general 
training of the model; column 16, lines 14-20); and 

a recognition step of recognizing unknown input speech by using the speech 
model learned in said learning step (training data is used in general recognition; column 
16, lines 14-20). 

Regarding claims 17 and 18, Keiller discloses a control program having 
computer readable program code and a speech recognition method, comprising: 

a first program code unit for recognizing input speech used as the learning data 
so as to obtain a recognized character string pattern (input is taken as two training 
examples: one a new example and one an already existing example; column 15, lines 
25-35); 

a second program code unit for comparing a pattern of the recognized character 
string with a pattern of recording character string indicating a sentence to be recorded 
so as to obtain a matching rate therebetween, and of determining whether said 
matching rate exceeds a predetermined level system checks whether training examples 
are consistent (column 15, lines 28-30) by computing consistency scored (column 15, 
lines 53-65) and comparing the result against a threshold (95%, column 16, lines 6-8); 

a third program code unit for recording the input speech as the learning data for 
recognizing speech when it is determined in said determination step that said matching 
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rate exceeds a predetermined level (if results are consistent, they are used to generate 
a model for word being trained (column 15, lines 27-30), so inherently, the generated 
model is stored (recorded) to a memory means (column 16, lines 12-19)); 

a fourth program code unit for performing learning on a speech model by using 
the input speech recorded in said record step (the process described above provides 
general training of the model; column 16, lines 14-20); and 

a fifth program code unit for recognizing unknown input speech by using the 
speech model learned in said learning step (training data is used in general recognition; 
column 16, lines 14-20). 

Claim Rejections - 35 (JSC § 103 

5. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

6. Claims 4-7 and 11-14 rejected under 35 U.S.C. 103(a) as being unpatentable 
over Keiller in view of Yu (USPN 6,556,841 ). 

Regarding claims 4 and 11, Keiller discloses an apparatus and method for 
recording speech, to be used as learning data in speech recognition processing, but 
lacks further comprising presentation means for presenting an unmatched portion 
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between the recognized character string pattern and the recording character string 
pattern to a user as a result of performing the DP matching by said determination 
means. 

Yu discloses a speech correction device further comprising presentation means 
for presenting an unmatched portion (no substantial match) between the recognized 
character string pattern (character strings) and the recording character string pattern to 
a user as a result of performing the DP matching by said determination means (plurality 
if character strings stored in a dictionary; column 5, lines 15-22), for implementing spell 
checking and correcting applications. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Keiller's apparatus and method further 
comprising presentation means for presenting an unmatched portion between the 
recognized character string pattern and the recording character string pattern to a user 
as a result of performing the DP matching by said determination means, to identify 
unrecognized character strings (e.g. misspelled words) and to provide a notification or 
indication (column 6, lines 26-28). 

Regarding claims 5 and 12, Keiller discloses an apparatus and method for 
recording speech, to be used as learning data in speech recognition processing, but 
lacks wherein said presentation means presents the unmatched portion so as to identify 
the type of error as an insertion error, a missing error, or a substitute error, as a result of 
performing the DP matching by said determination means. 



Application/Control Number: 09/976,098 Page 9 

Art Unit: 2626 

Yu discloses a speech correction device wherein said presentation means 
presents the unmatched portion so as to identify the type of error (column 5, line 35) as 
an insertion error (adding the character input; column 3, lines 46-52), a missing error (? 
indicates that variations due; column 8, lines 44-47), or a substitute error (character 
substitution; column 8, lines 3-29 with column 7, lines 41-42 and lines 61-66), as a 
result of performing the DP matching by said determination means (column 7, lines 11- 
42), for notification and identification of unrecognized words. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Keiller's apparatus and method wherein said 
presentation means presents the unmatched portion so as to identify the type of error 
as an insertion error, a missing error, or a substitute error, as a result of performing the 
DP matching by said determination means, to identify unrecognized character strings 
(e.g. misspelled words) and to provide a notification or indication (column 6, lines 26- 
28). 

Regarding claims 6 and 13, Keiller discloses an apparatus and method for 
recording speech, to be used as learning data in speech recognition processing, but 
lacks wherein said presentation means simultaneously displays the recognized 
character string and the recording character string on a screen by changing a character 
attribute or a background attribute of an unmatched portion or a matched portion of at 
least one of the recognized character string and the recording character string. 

Yu discloses a speech correction device wherein said presentation means 
simultaneously displays the recognized character string and the recording character 
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string on a screen by changing a character attribute or a background attribute of an 
unmatched portion or a matched portion of at least one of the recognized character 
string and the recording character string, (underlined, bold, italics etc.; column 6, lines 
8-18), for notification and identification of unrecognized words. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Keiller's apparatus and method wherein said 
presentation means simultaneously displays the recognized character string and the 
recording character string on a screen by changing a character attribute or a 
background attribute of an unmatched portion or a matched portion of at least one of the 
recognized character string and the recording character string, to identify unrecognized 
character strings (e.g. misspelled words) and to provide a notification or indication 
(column 6, lines 26-28). 

Regarding claims 7 and 14, Keiller discloses an apparatus and method for 
recording speech, to be used as learning data in speech recognition processing, but 
lacks wherein said presentation means simultaneously displays the recognized 
character string and the recording character string on a screen by causing unmatched 
portion or matched portion of at least one recognized character string and the recording 
character string to blink (flashing; column 6, lines 8-18), for notification and identification 
of unrecognized words. 

Yu discloses a speech correction device wherein said presentation means 
simultaneously displays the recognized character string and the recording character 
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string on a screen by causing unmatched portion or matched portion of at least one 
recognized character string and the recording character string to blink. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Keiller's apparatus and method wherein said 
presentation means simultaneously displays the recognized character string and the 
recording character string on a screen by causing unmatched portion or matched 
portion of at least one recognized character string and the recording character string to 
blink, to identify unrecognized character strings (e.g. misspelled words) and to provide a 
notification or indication (column 6, lines 26-28). 

7. Claims 19-22 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Keiller in view of Koizumi et al. (PGPUB 2005/0131673), hereinafter referenced as 
Koizumi. 

Regarding claims 19-22, Keiller discloses an apparatus and method for 
recording speech, to be used as learning data in speech recognition processing, but 
does not specifically teach a display control means for controlling displaying of the 
character string indicating the sentence to be recorded, wherein said recognition means 
recognizes input speech of the displayed sentence that the user reads out. 

Koizumi discloses a speech device that comprises a display control means for 
controlling displaying of the character string indicating the sentence to be recorded, 
wherein said recognition means recognizes input speech of the displayed sentence that 
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the user reads out (speaks a sentence while viewing the sentence display screen; 
column 5, paragraph 0053), to retrieve and display sentence templates. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Keiller's apparatus and method wherein it 
teaches recognizing input speech of the displayed sentence that the user reads out, as 
taught by Koizumi, so aloe the user to be free from troublesome manipulation for 
replacing a word (abstract). 



Conclusion 

8. Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 
§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 
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9. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Jakieda R. Jackson whose telephone number is 
571 .272.761 9. The examiner can normally be reached on Monday through Friday from 
7:30 a.m. to 5:00p.m. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on 571.272.7843. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



JRJ 

March 21, 2006 
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